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Abstract 

This work provides explicit characterizations and formulae for the minimal polynomials of 
a wide variety of structured 4x4 matrices. These include symmetric, Hamiltonian and 
orthogonal matrices. Applications such as the complete determination of the Jordan structure 
of skew-Hamiltonian matrices and the computation of the Cayley transform are given. Some 
new classes of matrices are uncovered, whose behaviour insofar as minimal polynomials are 
concerned, is remarkably similar to those of skew-Hamiltonian and Hamiltonian matrices. 
The main technique is the invocation of the associative algebra isomorphism between the 
tensor product of the quaternions with themselves and the algebra of real 4x4 matrices. 



1 Introduction 

The minimal polynomial of a matrix is the unique monic polynomial of minimal degree which 
annihilates the matrix. It has several theoretical and practical uses. It provides information 
about the Jordan structure of the matrix, and in some situations can nearly determine it. Its 
principal utility, arguably, is in computing functions of a matrix such as the matrix exponential 
and the Cayley transform. While any annihilating polynomial can be used for this purpose, 
the complexity of the resultant expression is naturally minimal when the minimal polynomial 
is used. 

If one knows the Jordan structure of the matrix then its minimal polynomial is easily com- 
puted. However, since the former is difficult to arrive at, this is rarely advisable. Essentially 
any mechanism which explicitly detects linear dependence at the earliest stage in the matrices 
/ , A, A 2 , . . . , A n (in that order) will yield the minimal polynomial, [7, 8]. In this work we use 
quaternions to achieve the same for specific structured 4x4 matrices. Whilst, 4x4 matrices 
are amenable to the techniques of [7, 8], the corresponding calculations can be quite diffi- 
cult, and would not produce the closed form expressions for minimal polynomials presented 
herein. More discussion on this issue is presented in Section 5. In the method proposed here 
one replaces matrix calculations (specifically computing A k ) via quaternion calculations. Not 
only does this simplify such calculations, but it also yields elegant geometric interpretations 
of situations wherein the minimal polynomial is a particular polynomial. Of course, this 
methodology does not extend to higher dimensions immediately (see, however, the discussion 
in Section 5), but 4x4 matrices already cover several important applications to warrant 
the investigation of such a technique. In quantum computation, quantum optics, computer 
graphics, robotics etc., much of the analysis is reducible to the study of 4 x 4 matrices [see, 
for instance, [3, 4, 16]]. 

The isomorphism of H (g) H with M(4, R) is a central point in the theory of Clifford alge- 
bras. So a natural question is whether Clifford algebra isomorphisms can be used for similar 
purposes. In fact, the interesting work of [1] uses the symbolic and numerical computation 
the (real) minimal polynomial of matrices via their Clifford algebra representatives, for ex- 
ponentiation of matrices. The difference between the work of [1] and the results here, insofar 



as the problem of computation of minimal polynomials of matrices in M{A,R) is concerned, 
is that the structural (i.e., "geometrical") conditions given here on the entries of a matrix 
for it to possess a given minimal polynomial are missing in [1]. There are of course other 
differences. Section 5 discusses this issue briefly. 

It is appropriate at this point to record some history of the linear algebraic applications 
of the isomorphism between H (g> H and M 4 (i?). This isomorphism is central to the theory 
of Clifford algebras, [10]. However, it is only relatively recently been put into use for linear 
algebraic purposes. To the best of our knowledge, the first instance seems to be the work 
of [9], where it was used in the study of linear maps preserving the Ky-Fan norm. Then 
in [6], this connection was used to obtain the Schur canonical form explicitly for real 4x4 
skew-symmetric matrices. Next, is the work of [5, 11, 12], wherein this connection was put 
to innovative use for solving eigenproblems of several classes of structured 4x4 matrices. 
In [14, 15], this isomorphism was used to explicitly calculate the exponentials of a wide 
variety of 4 x 4 matrices. Finally, in [2] it was used to obtain, among other things, the polar 
decomposition of 4 x 4 symplectic matrices via the solution of 2 x 2 linear systems of equations. 

The balance of this paper are organized as follows. In the next section basic notation and 
preliminary facts are reviewed. The next section contains all the main results on minimal 
polynomials obtained by our method. Since many of the proofs are similar we provide proofs 
for only a part of the announced results. The fourth section contains three applications. 
The first is to the complete determination of the Jordan structure of 4 x 4 skew-Hamiltonian 
matrices. The second illustrates the usage of the results on minimal polynomials to calculate 
the Cayley transform in closed form. The final application is to the determination of the 
singular values of 3 x 3 real matrices. The next section discusses extension of the results of 
section 3 via the use of Clifford Algebras. In particular, classes of matrices are uncovered which 
behave very similar to skew-Hamiltonian matrices insofar as minimal polynomial matrices are 
concerned. Their block structures do not suggest this similarity. This section also provides a 
brief comparison of our technique with that of [1, 8]. The final section offers some conclusions. 



2 Notation and Preliminary Observations 

The classes of real matrices discussed in this work are as follows: 

• Skew-symmetric matrices, i.e., X satisfying X T = —X. 

• Hamiltonian matrices, i.e., matrices H satisfying H T J 2n = —J2 n H, where J 2n = 

( o I \ 

^ — In 0„ 

• Perskewsymmetric matrices, i.e, matrices X satsifying X T R n = —R n X, where R n is the 
n x n matrix containing Is on its main anti-diagonal and Os elsewhere. R n is sometimes 
denoted F and is called the flip matrix. 

• For a matrix X <G M(n,R), we denote by Xp, its adjoint with respect to the non- 
degenerate bilinear form defined by R n , i.e., it is the matrix R n X T R n . 

• For a matrix X 6 M(2n,R), we denote by Xh, its adjoint with respect to the non- 
degenerate bilinear form defined by J„, i.e., it is the matrix —Ji n X T J2n- 

• Symmetric matrices, i.e., X with X T = X. 

• Skew-Hamiltonian matrices, i.e, matrices X satisfying X T J 2n — JinX. 

• Special orthogonal matrices, i.e., X with X T X = XX T = I n and det(X) = 1. 

These classes were picked because i) they are ubiquitous in applications; and ii) in most 
cases, as will be seen subsequently, elegant geometric conditions on their quaternionic repre- 
sentations can be given which ensure their possessing a certain minimal polynomial. Matrices 
such as persymmetric and symplectic matrices do not seem that amenable by the latter con- 
sideration, and therefore are not considered here. We note, however, that in the final section 
we discuss how quaternion techniques can be used to compute the minimal polynomial of a 
general matrix in M(4, R). 

Definition 2.1 H stands for the real division algebra of the quaternions. V stands for the 
purely imaginary quaternions. 

We will tacitly identify an element of V with the corresponding vector in R 3 . With this 
understood, the following two identities will be frequently used. 



• Let p,q £P. Then pq = — {p.q)l + p x q. 

• Let p,q,r £ R 3 . Then 

p x (q x r) = (p.r)q — (p.q)r (2.1) 

H ® H and g>Z(4, i?): The algebra isomorphism between H ®H and M 4 (i?) (also denoted by 
gl(4,R)), which is central to this work, may be summarized as follows: 

• Associate to each product tensor p® q £ H ® H, the matrix, M p ® q , of the map which 
sends x £ H to pxq, identifying R A with H via the basis {1, i,j, k}. Here, q = q^ — q\i — 
123 -q$k 

• Extend this to the full tensor product by linearity. This yields an associative algebra 
isomorphism between H®H and M 4 (i?). Furthermore, a basis for g/(4, R) is provided by 
the sixteen matrices M ex< ^ ey as e x , e y run through 1, k. In particular, i? 4 , the matrix 
intervening in the definition of perskewsymmetric matrices, and J 4 , the matrix used in 
the definition of Hamiltonian and skew-Hamiltonian matrices, represented respectively, 
by Mj®i and Mi®j, belong to this basis. 

Quaternion Representations of Special Classes of Matrices: Throughout this work, 
the following list of H ® H representations of the above classes of matrices will be used: 

• Skew-Symmetric Matrices: s <g> 1 + 1 ® t with s, t £ V . 

• Hamiltonian Matrices: 6(1 ®j)+p®l + q®i + r<g>k, with b £ 1Z and p,q,r £ V . 

• Perskewsymmetric Matrices: r($i + j!%>s + a(l ® i) + (3(j ® 1), with r,s £ V and 
a, /3 £ 11. 

• Symmetric Matrices: a\®l+p®i + q®j + r®k, with a £ TZ and p,q,r £ V . 

• Skew-Hamiltonian Matrices: 6(1 <S> 1) +p (8> j + 1 ® (ci + dfc), with b,c,d £lZ and p £V . 

• Special Orthogonal Matrices: u®v, with u,w unit quaternions, i.e., || w || = || w |j= 1. 

These can be easily obtained from the entries of the 4x4 matrix in question (see [11, 5, 12] 
for some instances). The key to this consists of the following two observations 



• Conjugation in H ® H corresponds to matrix transposition in gl(4,R), i.e., Mp^q = 
(M p ®q) T . This is why, for instance, symmetric matrices correspond to c(l ® 1) + p® 
i + q®j + r®k with p, g, r purely imaginary, and skew-symmetric matrices correspond 
to s <g> 1 + 1 (g) i, s,i £ P. 

• Hamiltonian (resp. skew-Hamiltonian) matrices are expressible as J2 n S, with S sym- 
metric (resp. skew-symmetric). Similarly persymmetric (resp. perskew-symmetric) 
matrices are expressible as R n S with S symmetric (resp. skew-symmetric). Thus, for 
instance, perskewsymmetric matrices are represented by (j®i)[s®l+l®f], s,t £P. This 
simplifies to p®i + a(j® l)+j®g+/3(l ®i) withp G span {i,k},q G span {j, k},a,(3 E R. 
If such a matrix is simultaneously symmetric, then a = /3 = 0, etc., 

Combining these two observations with the explicit forms of the sixteen matrices, M e ^ ey 
leads to H ® H representations, in terms of the entries of the matrices. For the first five 
classes, the expressions for the H ® H representations are linear in the entries of the matrix. 
Sec [11, 5, 12] for these expressions. For special orthogonal matrices, the entries of the matrix 
are quadratic in u and v. See [4] for an algorithmic determination of the unit quaternions u 
and v from the entries of a special orthogonal matrix. 

By way of illustration, the requisite expression for the quaternionic representation for a 
skew-Hamiltonian matrix is provided below. 

Proposition 2.1 [5] Let X be a skew-Hamiltonian matrix. Its H ® H representation is 
6(1 ® 1) + p ® j + 1 (8) (ci + dk), with b, c, d e R and p a purely imaginary quaternion. 
The formulae relating these to X's entries are as follows: i) b = 5(^11 + -^22); h) P = 
i[(X 32 - X u ), (X u - X22), (X 12 + X 21 )]- iii) c = \(X 12 - X 2 i); iv) d = |(X 14 + X 32 ). 

We close this section with the notion of the reverse of a polynomial. 

Definition 2.2 If p(x) = Ym=q a i x% i s a polynomial of degree n, then its reverse is the 
polynomial pr ev (x) = Y^i=o a ^-i xl - 



3 Minimal Polynomials of Classes of 4 x 4 Matrices 

We begin with a simple proposition, applicable in arbitrary dimensions, which reduces the 
list of possible minimal polynomials for some of the matrices to be considered here. 

Proposition 3.1 • I) Let A T be similar to —A. If the degree of the minimal polynomial 
of A is even, then its minimal polynomial is an even polynomial. If the degree of the 
minimal polyomial is odd, then it is an odd polynomial. 

• II) Let A^ 1 be similar to A T . Then the constant term in its minimal polynomial is 
either +1 or —1. If it is the former, then its minimal polynomial equals its reverse. If 
it is the latter it is minus its reverse. 

Proof: I) Let Qa(x) = x k + Y^lZo aixl ^ e tne mm imal polynomial of A (and thus of A T ). 
Then clearly the polynomial p(x) = x k + J2i=a(~ ^Y^iX 1 annihilates the matrix —A. So, p(x), 
which is monic, has to be the minimal polynomial of —A. Indeed, if ( x ) = x l + Si=o hx l , 
was the minimal polynomial of — A (with I < k) then x l + X^=o( — ^) l biX l annihilates A, 
contradicting the minimality of Qa{x). Thus, since — A and A T are similar, Qa(^) = p(x), and 
the result follows. 

II) Let qA(x) = x k + J2i=o a i x% be the minimal polynomial of A. Since A is invertible 
a 7^ 0. Then, we find that p(x) = j^{x k + J2i=o a k-iX l ) annihilates A" 1 . Now, by an 
argument similar to I), p(x) has to be the minimal polynomial of A^ 1 . The similarity of A T 
and A^ 1 then implies the equality of p(x) and qA (x). This forces do to be either 1 or —1. This 
then means p(x) equals its reverse when a = 1 or it equals minus its reverse when ao = — 1, 
and the result follows. 

Remark 3.1 Note that even if det(A) = 1, for matrices as in II) of Proposition (3.1), it is 
possible that a = —1, and thus the minimal polynomial equals minus its reverse. This is 
in sharp contrast with the characteristic polynomial of such an A, which always equals its 
reverse. 

Proposition (3.1) shortens the list of possible minimal polynomials for skew-symmetric, 
Hamiltonian, perskewsymmetric and special orthogonal matrices, since in each of these cases 
A T is similar to either —A or A^ 1 . When A T is similar to A, there are no such general results. 



Next follow our main results about minimal polynomials. As mentioned in Section 1, we 
detail only those cases where one has an "elegant" condition on the H ® H representations 
of the matrix in question which is equivalent to the matrix having the said polynomial as 
its minimal polynomial. This already contains an extensive collection of useful matrices. 
Furthermore, since the proofs are similar, we present details only for some cases. 

Theorem 3.1 Minimal Polynomials of Antisymmetric Matrices: Let S be antisymmetric, 
with representation s ® 1 + 1 <8> t. Then, 

• S has a quadratic minimal polynomial, which equals x 2 + A 2 , iff precisely one of s or t 
is equal to zero. Furthermore, in this case, A 2 is either s.s or t.t. 

• S has a cubic minimal polynomial, which equals p{x) = x 3 + (A 2 + 2l)x, iff 



Furthermore, in this case, A 2 = s.s + t.t and I =| s || 2 . 

• If none of the above conditions hold, the minimal polynomial is the characteristic poly- 
nomial which equals p(x) — x 4 + 2X 2 x 2 — (4/ 2 — A 4 ), with A and I as above. 

Theorem 3.2 Minimal Polynomials of Hamiltonian Matrices: Let H be Hamiltonian with 
representation 6(1 <g> j) + p <g> 1 + q ® i + r ® k. Then, 

• H has a quadratic minimal polynomial, which equals p(x) = x — u>, with cu = —b 2 — 
p.p + q.q + r.r, iff p.r = p.q = and r x q = —bp. Notice, if b ^ 0, then the first two 
conditions are subsumed by the last condition. 

• H has a cubic minimal polynomial, which equals p(x) = x 3 — (to + 2k)x 1 with uj as in 
the quadratic minimal polynomial case and k as specified below, iff one of the following 
five mutually exclusive conditions hold [See Remark (3.2), below, for special cases of 
these conditions]. 

1.6^0 and the matrix G — X T X, with X = [p | q \ r], has the the matrix 

( U2 , \ 

b + k —p.q —p.r 
Y = —p.q r.r — k —q.r 
—p.r —q.r q.q — k 




with k — ^\p.(qxr) — b(p.p)], as its "near inverse". Specifically GY = b\p.(q x r)]Is, 
In this case the coefficient k in the given cubic minimal polynomial is \ [p. (qxr) — 
bp.p}. 

2. b — 0, r x q ^ 0, p = r.q = and q.q = r.r. In this case k = r.r + q.q 

3. b = 0, r x q ^ 0, p ^ 0, p.(q x r) = 0, p.q = 0, r.q = 0, p.r ^ and (r.r) 2 + (p.r) 2 = 
(q.q)(r.r). In this case A: = r.r. 

4. 6 = 0, r x q ^ 0, p ^ 0, p.r = 0, q.r = 0, p.<? ^ 0, and (g.g) 2 + [p.q) 2 = (q.q)(r.r). 
In this case, fc = (j.q. 

5. b = 0, r x q ^ 0,p ^ 0, p. (<j x r) = 0, p.q 7^ ^ p.r and the following four quantities 
are all equal to — |^(p.r), 

11 112 f, v n M 2 ,v-q, \ I, 1,2 [(q-p) 2 + (q- r ) 2 } I, 1,2 I( r --P) 2 + {i- r f 

Nil + — (r.«),||r II + — (r.q), II r || — ~ 2 ,\\q\\ — 

In this case k — — j^(p.r). 

• If none of the above conditions hold, the minimal polynomial is the characteristic poly- 
nomial, which equals 

p(x) = x 4 - 2lux 2 - (4b 2 \\p\\ 2 +8bp.(r x q) + 4 1 1 r x q \ \ 2 -lu 2 - 4(p.q) 2 - 4(p.r) 2 ) 

Theorem 3.3 Minimal Polynomials of Perskewsymmetric Matrices: Let P be a perskewsym- 
metric matrix with representation r®i+j®s+ a(l ® i) + (3(j ® 1). Then, 

• P has a quadratic minimal polynomial iff one of the following three mutually exclusive 
sets of conditions hold. These are: \)a = 0, ^ 0, s = 0; ii)/3 = 0, a ^ 0, r = 0; hi) 
a = f3 = and either r x j = or s x i = 0. In each of these cases the minimal 
polynomial is x 2 — A 2 , with A 2 = r.r + s.s — a 2 — /3 2 . 

• P has a cubic minimal polynomial iff a 2 — /3 2 = s.s — r.r (without any of the conditions in 
the quadratic minimal polynomial case occurring). In this case the minimal polynomial 
is x 3 - (A 2 +2a 2 - 2(s.s))x. 

• If none of the above conditions hold the minimal polynomial is the characteristic poly- 
nomial which equals 

p(x) = x 4 - 2X 2 x 2 - [4/3(s.s) - Aa{r.r) + 4a 2 /3 2 - 4 1 1 r x j \ \ 2 . \ \ s x i \ \ 2 - A 4 ] 



Sketch of the Proof: We will illustrate the calculations involved by proving the conditions 
for quadratic and cubic minimal polynomials for a Hamiltonian matrix H . 
Quadratic Case: H 2, s quaternionic representation is 

H 2 = (-b 2 -p.p + q.q + r.r)(l ® 1) + 2(r x q + bp)®j - (2p.q)l®j - (2p.r)l <g> k 

According to Proposition (3.1) if at all H 2 is linearly dependent on a lower power of H, that 
power has to be 1 ® 1. A necessary and sufficient condition for that to happen is evidently 

r x q = —bp,p.q = 0,p.r = 

Clearly, if b ^ 0, this set of conditions is equivalent to bp = q x r. If these conditions hold the 
minimal polynomial of H is x 2 — u>, with 

lu = —b 2 — p.p + q.q + r.r 

Cubic Case: By a direct calculation, which makes copious use of the vector triple identity 
[Equation (2.1)], one finds that 

H 3 = ujH + 2[p.(q x r) — bp.p]l ® J + 2[—b 2 p + b(q x r) + (p.q)q + (p.r)r] 1 
+ 2[— (p.q)p — b(r x p) + (r.r)q — (r.q)r] <g) i 

+ 2[-{p.r)p - b(p x q) + (q.q)r - (q.r)q)} (g) k 

In view of Proposition (3.1), for H to have a cubic minimal polynomial, therefore there 
has to be a real k such that 

\p.(q x r) - bp.p] = kb (3.2) 

and further that 

(b 2 + k)p - (p.q)q - (p.r)r = b(q x r) (3.3) 

— (p.q)p+(r.r — k)q—(r.q)r = b(r X p) 

-(p.r)p - (q.r)q + (q.q - k)r = b(p x q) 

When this happens, in absence of the conditions for a quadratic minimal polynomial, the 
minimal polynomial of H is 

p(x) = x 3 - (w + 2k)x 

There are now two possibilities. 



• 6 ^ 0, or 

• 6 = 0. 

In the former case, we find 

k = -r\p-(q x r) — bp.p] 
b 

Next, noting that G is the Gram matrix of X = [p \ q \ r], one finds that taking the inner 
product on both sides of Equation (3.3) successively with p, q, r yields 

GY = b[p.(q x r)]/ 3 

Conversely if GY = b\p.(q x r)]Is, one easily obtains Equation (3.3). For instance, if we denote 
by v the vector (b 2 + k)p — (p.q)q — {p-r)r, then, in view of the first column of Y, we find 
v.r = = v.q, and hence v is proportional to q x r. The remaining entry in this column of Y 
confirms that v is indeed b(q x r). Hence H has the stated minimal polynomial. 
Now suppose, 6 = 0. Then we first need r x q ^ 0, for otherwise we would have 

2{bp+ (r xq))=0 

which corresponds to the quadratic minimal polynomial case. Further, under the condition 
6 = 0, Equations (3.2) and (3.3) reduce to 

p.(?xr)=0 (3.4) 

and 

kp — (p.q)q — (p.r)r = (3.5) 

— {p.q)p + (r.r — k)q — (r.q)r = 

— (p.r)p — (q.r)q + (q.q — k)r = 

Now the analysis of the conditions equivalent to H having the stated minimal polynomial 
may be divided into two further cases: 

• p = 0, or 

• p^Q. 



Suppose first that p is zero. Then Equation (3.4) and the first Equation in the system 
(3.5) are trivially satisfied, while the remaining two equations of Equation (3.5) yield 

(r.r — k)q = (r.q)r 
(q.q - k)r = (q.r)q 

These two equations contradict r x q ^ 0, unless r.r = q.q = k and q.r = 0. Conversely these 
two conditions are trivially sufficient to ensure that H has the said cubic minimal polynomial 
when p = = b. 

Next, suppose p^O. Then certainly the linear independence of q and r, and the linear 
dependence of p on them is required. Now at least one of p.q or p.r is not zero, for otherwise 
p becomes zero, contradicting the starting assumption for this case. 

Now the analysis may be divided into three cases: 

• p.q = 0, but p.r ^ 0. Then taking the inner product of the first equation in (3.5) 
with q forces r.q = 0. Next, by taking the inner product of the same equation with 
r yields k — r.r. The second equation also forces k = r.r. The third equation is 
trivially satisfied upon taking inner product with q, while taking inner product with 
r forces k — is^iiLLt^Li , Hence, we necessarily require (r.r) 2 + (p.r) 2 = (q.q)(r.r). 
Conversely, if these conditions hold, then the vectors formed by the left hand sides of 
Equation (3.5), which are in the span of q and r, are by construction orthogonal to q 
and r. Hence they must be zero. Thus, Equations (3.4) and (3.5) are satisfied and hence 
H has the stated cubic minimal polynomial. 

• p.r = and p.q ^ 0. Then, by an argument similar to the one above, a necessary and 
sufficient set of conditions are given by i) q x r ^ 0; ii) p.(q x r) = 0; iii)r.g = 0; iv) 
(q.q) 2 + (p.q) 2 — (q.q)(r.r). In this case, k = q.q. 

• Neither p.q nor p.r is zero. Then, first by taking inner product with q of the last equation 
of the system of (3.5), for instance, one sees that r.q ^ 0. Next taking the inner product 
with respect to q, first and then with respect to r of all equations in the system (3.5), 
one arrives at six possible expressions for k. Of these two are already equal to —^(p.r). 



The remaining four arc 

Nil 2 + P ~(r. q ), II r || 2 || r || 2 -fe^fl, || 9 || 2 _ El^|^ 

p.g p.r || q \\ z \\ r \\ z 

Hence necessarily these four quantities are equal to each other and to — |^(p.r). Con- 
versely, these conditions are sufficient to ensure that H has the said cubic minimal 
polynomial with k — — ^(p.r). 



Remark 3.2 There are some special cases of the above result for the stated cubic minimal 
polynomial for a Hamiltonian matrix H, which deserve mention. 

• First if b 0,p ^ 0, and p,q,r are collinear, then H has the given cubic minimal 
polynomial iff 

b 2 = p.p + q.q + r.r 

In this case k — —p.p. Indeed, in this case both the matrices G and Y are rank one 
matrices, and the condition GY = then is equivalent to b 2 = p.p + q.q + r.r. 

Note this contains the special case that q = r = 0. In this case, H is also skew- 
symmetric, and we find that a necessary and sufficient condition for H to have the 
given polynomial as its minimal polynomial is p.p = b 2 . This, as is easily seen, is 
in keeping with the conditions for a skew-symmetric matrix to have a cubic minimal 
polynomial. 

• A second special case, diametrically opposed to the previous one, occurs when the 
vectors p, q, r are all non-zero, and satisfy q x r = ap, r x p = (3q,p x q = jr, for some 
non-zero real numbers a, f3, 7. One then finds that a ^ b (for otherwise, we would have 
a quadratic minimal polynomial). In this case k = (a — b)b and G and Y are both 
diagonal. Then the condition GY = bp.(q x r)I 3 is equivalent to /? = 7 (equivalently 
q.q = r.r) and 

b(3 2 + (a- b)b -ab 2 = 
These conditions are satisfied if, for instance, 6 = /3 = 7, a ^ b and p.p = b 2 . 

• Note when b = = p, H is a symmetric, Hamiltonian matrix. The conditions stated 
above for a cubic minimal polynomial for H also follow from Theorem (3.5) below. 



Remark 3.3 Note that there is an asymmetry in the role of p (vis a vis q, r) in the matrix 
Y intervening in the conditions for a cubic minimal polynomial for a Hamiltonian matrix H . 
This is not surprising since p stems from the anti-symmetric part of H, while q, r stem from 
the symmetric part of H . 

Next we study minimal polynomials for skew-Hamiltonian and symmetric matrices. Now 
Proposition (3.1) does not apply. Nevertheless we will find that the former always have 
quadratic minimal polynomials, and this is an illustration of the utility of quaternions. For 
the latter, in order to minimize bookkeeping, we suppose they are traceless. Once the minimal 
polynomial of these are found, those of symmetric matrices with non-zero trace are easily 
found. 

Theorem 3.4 Minimal Polynomials of Skew-Hamiltonian Matrices: Let W be skew-Hamiltonian 
with representation b{l®l)+p®j + \®(ci+dk) . Then W has a quadratic minimal polynomial, 
which equals p{x) — x 2 — 2bx + k, with k = b 2 — \ \ p || 2 +c 2 + d 2 . . 

Theorem 3.5 Minimal Polynomials of Symmetric Matrices: Let S be a non-zero, traceless 
symmetric matrix with representation p <g> i + q® j + r ® k. Then 

• i) S has the quadratic minimal polynomial p{x) = x 2 — X 2 iff the rank of X = [p, q, r] is 
one. In this case A 2 = (p.p + q.q + r.r). 

• ii) S has the quadratic minimal polynomial p(x) = x 2 — 2lx — A 2 iff p x q = Ir, q x r = 
lp,r x p = Iq, for the same non-zero / and A ^ 0, as in i) above. 

• S has the cubic minimal polynomial p(x) = x 3 — (A 2 + 2a)x iff the rank of X is two and 
one of the three following mutually exclusive conditions hold: 

— Upto cyclic permutations of p,q,r, p.q = 0, rxp = = qx r,p.p = q.q. In this 
case a = p.p. 

— Upto cyclic permutations of p, q and r, one has p x q — 0,r x p ^ 7 q.r = 0, p.p + 
q.q = r.r. In this case a = r.r (note the conditions p x q = and q.r = imply 
r.p = also). 

— None oi p x q, q x r or r x p is zero and the following set of equalities holds 



r 



2 



[(r.p)(r.q)] 




[(q.p)(r.q)] 




[(r.p)(p.q)] 



p.q 



p.r 



q.r 



In this case a =|| p || 2 -iiLEiisM. 

ii i ii q.r 

• When the degree of the minimal polynomial is four, the minimal (and characteristic) 
polynomial is p{x) = x 4 - 2X 2 x 2 - 8(p.(q x r))x + [A 4 - 4(|| q x r \\ 2 + \\ r x p || 2 + || 
pxq || 2 ]. 

Note: In the case of symmetric matrices, there are other cubic minimal polynomials. Ex- 
pressions and conditions for them can be found, but they do not have elegant geometric 
interpretations, and so we omit them. 

Sketch of the proof: Once again we illustrate the quadratic and cubic minimal polynomial 
case for traceless, symmetric matrices. One first finds that S 2 is given by 

S 2 = (II P II 2 + II q II 2 + II r || 2 )1 ® l + 2(p xq)®k + 2{qx r) ® i + 2(r X p) <g> j 

Clearly then p(x) = x 2 — A 2 can annihilate Siffpxq = qxr = rxp = 0, i.e., iff [p, q, r] has 
rank one. When this holds A 2 = p.p + q.q + r.r. 

Similarly, p{x) = x 2 — 2lx — A 2 annihilates S iff p x q = Ir, q x r = Ip, r x p = Iq for the 
same non-zero I. When this happens A 2 has to be necessarily p.p + q.q + r.r. 

Next a calculation shows that 

S 3 = \ 2 S + 6p.{q x r)l®l 

+2[q x (p x q) - r x (r x p) <g> i 

+2[r x (q x r) - p x (p x q)] ® j + 2[p x (r x p) - q x (q x r)] <g> k 

It follows that for S to have the desired minimal polynomial one needs p.(q x r) — and that 
the following conditioning all cyclic permutations of it, have to hold 

(q x p) x q + (r x p) x r = ap (3.6) 

for the same non-zero a. 

The condition p.(qxr) =0 forces the rank of X = [p, q, r] to be atmost two. It has to be 
two, since the rank one case corresponds to a quadratic minimal polynomial. Hence rank of 
X T X = 2. Since X T X is positive semidefinite, at least one principal minor of order two has 
to be non-zero. Hence further analysis can be divided into three mutually exclusive cases: 



• Precisely one 2x2 principal minor of X T X is non-zero - say the one corresponding to 
the pair (p, q). Thus rxp=0 = gxp, but p x q ^ 0. So the system (3.6) reduces to 

q x (p x q) = ap; (p X q) x p = aq; ar = 

Hence, the linear independence of p, q first forces p.q = and q.q = p.p = a. This 
implies a^O and hence r = 0. Conversely these conditions arc sufficient for S to have 
the stated minimum polynomial. 

• Precisely two of the 2x2 principal minors of X T X are zero, say those corresponding to 
the pairs (r,p) and (q,r). In particular, p x q = 0. Writing out the system (3.6) under 
these assumptions, we find that 

r.p = 0, a = r.r, r.q = 0, a = p.p + q.q 

So the stated conditions are necessary and it is easy to see their sufficiency as well. 

• None of the 2x2 principal minors of X T X are zero. Thus each of the pairs (p, q), (q, r) 
and (r,p) are linearly independent, but each of the three vectors is linearly dependent 
on the remaining two. Then the system (3.6) is equivalent to 

(q.q + r.r - a)p = (q.p)q + {r.p)r (3.7) 
(r.r + p.p — a)q = (q.r)r + (q.p)p 
(p.p + q.q - a)r = (r.p)p + (r.q)q 

Then the first equation in the last system says q.p ^ 0. Indeed, if q.p — 0, then the 
linear independence of the pair of vectors (r,p) would force r.p = as well. But then, 
p being linearly dependent on the pair (q, r) would have to be zero. Similarly r.p ^ 
and q.r 0. 

Now successively taking the inner product of the first equation in the above system with 
q, r; of the second equation with r,p; and the third equation with p, q, yields six possible 
expressions for a. Of these three are identical. Thus, we have three distinct expressions 
for a, which have therefore got to coincide, i.e., it is necessary that 

rr [(r-P)(r.q)] =qq _ [(g-P)( r -g)l [( r -P)(<l-P)] 
p.q p.r ' q.r 



Conversely if the above equalities hold, then the vectors represented by the left hand 
sides of the system (3.7) are equal to the corresponding right hand sides. Hence these 
conditions are necessary and sufficient for S to have the stated cubic minimal polyno- 
mial. 

Remark 3.4 It is not enough for X = [p, q, r] to have rank 2 for a symmetric S to have the 
stated cubic minimal polynomial. The remaining conditions are needed. In fact, it turns out 
that all other cases when X has rank two correspond to fourth degree minimal polynomials. 

We next consider matrices in SO(A,R). II) of Proposition (3.1) applies to such matrices. 

Theorem 3.6 Minimal Polynomials of Special Orthogonal Matrices: Let G^I,—I be rep- 
resented by u ® v. Let u = Re(u), v = Re(v). Then, 

• G has minimal polynomial x 2 — 1 iff u = v a = 0. 

• G has minimal polynomial x 2 + ax + 1 iff either (but not both) lm(u) = or lm(v) = 0. 
In this case, a = —2vq (rcsp. — 2uq). 

• G has minimal polynomial x 3 — ax 2 + ax — 1 iff u = v a / 0. In this case a = 4u t>o — 1- 

• G has minimal polynomial x 3 + ax 2 + ax + 1 iff u = —v a ^ 0. In this case a — 
-(l + 4u v ). 

• If none of the above conditions hold, G's minimal polynomial is its characteristic poly- 
nomial which equals x A + ax 2 + bx 2 + ax + 1, with a = —AuqVo, b = 4uq + 4vq — 2. 

Sketch of the proof: First, since G is neither / nor —I, lm(u) and lm(v) cannot be 
simultaneously zero. 

Next, using u 2 = (2u§ — 1) + 2u lm(u) (and a similar expression for v 2 ), we see G 2 is 
represented by 

(2^-l)(2^-l)(l®l)+2« (2uo-l)(I^W®l)+2u (2^-l)(l(8)Im(u))-|-4uot;o(Irn(u)®Im(u)) 

In view of Proposition (3.1) the only possible candidates for a quadratic minimal polynomial 
are p(x) = x 2 — 1 and p{x) = x 2 + ax + 1. 

For G 2 = I, it is necessary and sufficient that all of the following to hold: 



. (2u 2 - \){2v 2 - 1) = 1. 

• 2v (2u 2 - 1) = or Im(v) = 0. 

• 2u (2v 2 - 1) = or lm(u) = 

• 4mo«o = or lm(u) = or Im(tj) = 0. 

Suppose, Im{u) = 0. Then u§ = 1 and lm(v) ^ 0. So the second condition above forces 
v = 0. But then the first condition above is not satisfied. Similarly the condition Im(ti) = 
is untenable. Hence, Im(w) ^ ^ lm(v). Now the fourth and the first conditions together 
force u = = v . Conversely, when u = = v we see, from conditions above, that G 2 = I. 

Next, consider G 2 + aG + 1 = 0. Suppose neither lm(u) nor lm(v) is zero. Then, by 
comparing the lm(u) (g) Im(v) coefficient, it is necessary that a = —4u v . But if a = —4u v , 
then (by comparing the 1® 1 coefficient) we see Uq+v 2 = 1, while by comparing the coefficients 
of lm(u) <g> 1 and 1 ® lm(v), we find u = = v Q . Hence, necessarily precisely one of lm(u) 
or lm(v) is zero. 

Suppose Ira(u) = 0. Then u is +1 or —1. By absorbing the negative coefficient in v, we 
may suppose u = 1- In this case a further necessary condition is a = —2v - Similarly, if 
lm(v) = 0, a = —2uo is required. Conversely, these conditions are easily seen to be sufficient 
for p{x) = x 2 + ax + 1 to be the minimal polynomial of G. 

Next we study the necessary and sufficient conditions for G to have cubic minimal poly- 
nomials. First, we find that 

G 3 = u v (16ulv 2 - \2u\ - Ylv\ + 9)1 <S> 1 
v [20ulv 2 - 12ul - 6v 2 + 3]Im(u) ® 1 
u [20u^ - 12vl - 6ul + 3]1 ® lm(v) 
[161*0^0 - 4uq - 4wq + l]Im(u) ® lm(v) 

By Proposition (3.1) it suffices to consider when G 3 +aG 2 +aG+I = or G 3 -aG 2 +aG-I = 
for suitable constants a. 

Writing out the former we get 

/i(a, u , v )l ® 1 + / 2 (a, u wo)Iw(u) ® 1 + /s(a, u v )l ® Im(u) + / 4 (a, u fo)Im(u) (g)Im(ti) = 



for some polynomials fi,i — 1,...,4, whose explicit form we omit for brevity. Necessar- 
ily lm(u) ^ ^ Im(v) (for otherwise we would be in the case of lower degree minimal 
polynomials). Hence a necessary and sufficient condition for G 3 + aG 2 + aG + I = is 
fi(a, uo, v n ) = 0, i =,..., 4. These four equalities are equivalent to w = — «o- Clearly we need 
u = —v ^ to preclude G 2 = I. In this case, we also find a = —(I + 4m wo)- Similarly 
x 3 — ax 2 + ax — 1 is the minimal polynomial iff u = v a ^ and a — 4u «o — 1. 

Remark 3.5 While (II) of Proposition (3.1) applies to other groups of matrices such as 
symplectic matrices, finding quaternioninc representations for them is quite arduous, and the 
formulae for such representations are not nearly as succinct as those for matrices in 5*0(4, R). 
In [2], a quaternionic representation for 5p(4, i?) was obtained. In particular, this was used 
to find a closed form formula for the characteristic polynomial of such matrices. Extending 
this to find expressions for the minimal polynomial remains to be investigated. 

4 Illustrative Applications 

In this section we work out a few sample applications of the foregoing results. The first appli- 
cation shows that Jordan structure of skew-Hamiltonian matrices is determined completely 
by its minimal polynomial plus a single rank calculation (which can be performed in closed 
form). The second application works out the Cayley transform of skew-Hamiltonian matrices. 
Finally, we show how the minimal polynomial calculation of symmetric matrices can be used 
to determine the singular values of 3 x 3 real matrices. 
Jordan Structure of Skew-Hamiltonian Matrices: 

Proposition 4.1 Let W be a non-scalar, skew-Hamiltonian with quaternionic representation 
6(1 <S> 1) + p <g> j + 1 ® (ci + dk). Then W is diagonalizable iff || p ||V c 2 + d 2 . The 
Jordan normal form of W is either diag(b + fi, b + /i, b — /i, b — /i) or diag(J2{b), Jiib)). Here 
p, = y/\\p || 2 — c 2 — d 2 , and J2(b) stands for the standard 2x2 Jordan block with b as the 
corresponding eigenvalue. Finally the characteristic polynomial of W is p(x) = x 4 — Abx 3 + 
(6b 2 - 2i?)x 2 + (46^ 2 - Ab 3 )x + 6 4 + M 4 - 2 M 2 6 2 . 

Proof: First since W is non-scalar, the quantity 9 2 ~\\ p \\ 2 +c 2 + d 2 is non-zero. Per 
Theorem (3.4), W has minimal polynomial x 2 — 2bx + n, where k = b 2 — \\ p || 2 +c 2 + d 2 . This 



polynomial has roots (b + ix, b — /x), which are distinct iff /x ^ 0, whence the first conclusion. 
The algebraic multiplicity of both the roots, b + fi and b — (i, as roots of the characteristic 
polynomial has to be two each, for any other configuration of algebraic multiplicities would 
not yield Tr(W) — 46. This yields the stated characteristic polynomial. Note that when 
/x = 0, the sole eigenvalue is b with algebraic multiplicity four. 

Next, when /x ^ 0, W is diagonalizable and, in view of the algebraic multiplicities men- 
tioned above, the corresponding Jordan form is diag(b + /x, b + ix, b — ix, b — ix). 

When /x = 0, 6 is a two-fold root of the minimal polynomial. Hence the size of the largest 
Jordan block corresponding to the sole eigenvalue, b, has to be 2. Thus, the remaining Jordan 
blocks are either a single Jordan block of size 2 or two Jordan blocks each of size 1. To 
determine which possibility occurs, recall that for an n x n matrix W [7] 

rii = rj_i - 2n + r i+1 ,i = 0, 1, . . . ,n - 1 

where rn stands for the number of Jordan blocks of size i corresponding to a given eigenvalue 
A of W, and = rank(W — A/) fc , with the convention that r n+ \ = r n = n — v, with v being 
the algebraic multiplicity of A as a root of the characteristic polynomial. 

Let Y = W — bl. In view of the only possibilities for the Jordan form of W (when /x = 0), 
it is obvious that the rank of Y is either 1 or 2. From this we see that W has 2 Jordan blocks 
of size 2 each iff iank{Y) is 2, while it has one Jordan block of size 2 and two Jordan blocks 
of size 1 each iff vankiY) is 1. We will now show that only the former possibility can occur. 

This can be seen in a variety of ways. For instance, vk(Y) — rk(Y T Y) and the latter has 
rank 2 precisely when at least one of its 2 x 2 principal minors is non-zero. We will now show 
that at least one 2x2 principal minor My of Y T Y has to be non-zero. 

Since Y — p j + 1 ® (ci + dk), a simple calculation yields 

/ , \ 

9 2 - 2cp 3 + 2dp 1 2cp 2 -2dp 3 2dp 2 



Y Y = 



2cp 2 9 2 + 2cp 3 + 2dpx 2dp 2 2dp 3 - 2cpx 

-2cpi - 2dp 3 2dp 2 2 - 2dp x + 2cp 3 -2cp 2 

y 2dpi 2dp 3 — 2cpi —2cp 2 9 2 — 2dp x — 2cp 3 j 



(4.8) 

Here, as before, 9 2 =\ \ p \ \ 2 +c 2 + d 2 . We will now show that even 5 of the principal minors 
being zero leads to the contradiction 9 2 = 0. 



Specifically suppose 

• (6 2 + 2d Pl ) 2 - Ac 2 p 2 - Ac 2 p 2 = ((1, 2) minor). 

• 6> 4 - (2cp 3 - 2d Pl ) 2 - (2cpi + 2dp 3 ) 2 = ((1, 3) minor). 

• (6 2 - 2cp 3 ) 2 - Ad 2 p\ - Ad 2 p\ = ((1,4) minor). 

• (6 2 + 2cp 3 ) 2 - Ad 2 p\ - Ad 2 p\ = ((2, 3) minor). 

• (6 2 - 2d Pl ) 2 - Ac 2 p 2 3 - Ac 2 p 2 2 = ((3,4) minor). 

Now the above facts regarding the (1, 2) and (3, 4) minors are equivalent to dpi = (since 
2 7^ 0). Similarly the facts about the (1, 4) and (2, 3) minors are equivalent to cp 3 = 0. Hence 
these last two facts used in the (1, 4) and (1, 2) minor say 9 4 = Ad 2 p\ = Ac 2 p\. This means d 
and c are non-zero. Hence, necessarily p\=p 3 = 0. Using this last piece of information in the 
fact concerning the (1,3) minor gives 8 4 = - a contradiction. Hence rk(Y T Y) — rk(Y) = 2. 

Cayley transform of skew-Hamiltonian matrices: The Cayley transform of matrices 
provides a relationship between matrix Lie groups and their Lie algebras. It is interesting 
to compute it even for matrices not belonging to a Lie algebra. We do this below for skew- 
Hamiltonian matrices. 

Let "4>c(A) = (I — A) (I + A)^ 1 be the Cayley transform of A which is assumed to be 4 x 4 
skew-Hamiltonian. Since ipc(A) is not defined if —1 is an eigenvalue of A, we suppose that b 
equals neither — (1 + fi) nor —1 + fi. We know from the results above that this ensures that 
— 1 is not in the spectrum of A. 

Since A's minimal polynomial is quadratic, we know ipc(A) = cqI + C\A, with c , c\ some 
constants. So we get 

c x A 2 + (ci + c + 1)A + (c - 1)1 = 
and hence, in view of the minimal polynomial of A, 

(2bd + ci + c + I) A + (co - kci -1)1 = 

This leads to the following system of equations for c ,ci: 

c - kc\ = 1 (4.9) 
c + (2fe+l)ci = -1 



This yields c = §£±i^ and c x = 
Hence, 

. ... 26+l-/s_ -2 

^ ) = 2TTTT^ /+ 2TTTT^ A 

Singular values of 3 x 3 real matrices One can use the geometric characterizations of 
minimal polynomials of traceless 4x4 real symmetric matrices to infer information about 
the singular values of real 3x3 matrices. This follows from the results in [11], wherein the 
eigenvalues of the symmetric matrix X — p®i + q® j + r®k are related to the singular values 
of the real matrix Y = [p \ q \ r]. Thus, if u\ > o 2 > 03 are the singular values of Y and 
r = sgn detY (t = if Y is singular), then the eigenvalues of X are Ai = o\ + a 2 + T03, A 2 = 

(Jl — 0"2 — T(73, A3 = U 1 + <T2 — T(T3, A4 = U 1 — (72 + TCT3. 

From this expression and the fact that AT's minimal polynomial has to have distinct roots, 
one can infer the following relation between X's minimal polynomial, and therefore the cor- 
responding geometric conditions on p, q, r stated in Theorem (3.5), and Y's singular values: 

• (T2 = — £73, (Ti 7^ Oiff X has minimal polynomial x 2 — c 2 . 

• (7i = (7 2 7^ and cr 3 = iff X has minimal polynomial x 3 + cx. 

• cti = 02 = 03 7^ iff X has minimal polynomial x 2 — 2lx — A 2 . 

Remark 4.1 The above list only contains those statements regarding the singular values 
of Y = [p, q 7 r] corresponding to the list of minimal polynomials in Theorem (3.5). One 
can also infer the following statements regarding the singular values of Y by invoking the 
diagonalizability of X. Alternatively, the statements below about the singular values of Y 
can be used to augment the list of minimal polynomials of X. The corresponding conditions 
on p, q } r are too cumbersome to state. Partly because of this, and partly since that would 
have been contrary to the spirit of the paper, these minimal polynomials were not presented 
in Theorem (3.5) (cf, the note, immediately following the statement of Theorem (3.5) and 
Remark (3.4). 

• If Y has rank 2 and a\ ^ 02 then X has a quartic minimal polynomial. 

• Y has a cubic minimal polynomial other than x 3 +cx iff t ^ and either i) u\ = <t 2 ^ as. 
In this case no eigenvalue of X is zero; or ii) <r 2 = 03 ^ <?i- In this case X has a zero 



eigenvalue iff o\ = 2iT2; 



5 Extensions 

There are a few potential extensions of this work which we will discuss in this section. 

One trivial way to extend the above results is to consider block diagonal matrices, with 
each block 4x4. The minimal polynomial of such a matrix is the least common multiple of 
the minimal polynomials of the individual blocks. Thus, when each of these blocks belongs 
to any of the classes of matrices considered here, one can find in closed form their minimal 
polynomials. 

A second extension is to apply the theory of Clifford Algebras to calculate minimal poly- 
nomials, since each Clifford algebra arises as a suitable matrix algebra. In this regard we 
mention the interesting work of [1], where a symbolic calculation of the so-called real mini- 
mal polynomial is used to calculate exponentials of matrices. This, however, does not take 
into account the involutions of Clifford algebras, and thus the structure of the matrix is not 
used in finding minimal polynomials. In particular, there are no analogues of the geometric 
conditions on quaternions in the previous sections. 

To understand the crux of the differences between our work and that in [1] , it is useful to 
note the three features of H <g> H which enable our approach : 

• i) H <g) H has a basis in which every element squares to plus or minus 1. Furthermore, 
any two elements in this basis commute or anti-commute. 

• ii) The matrix analogue of the natural conjugation on H ® H is matrix transposition. 

• iii) The multiplication in H ® H is intimately related to the geometry of vectors in R 3 . 

For Clifford algebras the first feature goes through verbatim. The second feature's effect 
is somewhat diluted, inasmuch as the natural involutions of the theory of Clifford Algebras 
(Clifford conjugation and reversion), [10, 13], have easy matrix theoretic interpretations only 
in certain cases. Finally, the third feature is completely lost. In the work of [1], only the first 
feature is used. Hence the structural (i.e., geometric) conditions in this work on a matrix's 
H <g> H representation, for it to have a specific minimal polynomial, have no analogues in [1]. 



As mentioned in the previous paragraph, the three enabling features for the H®H isomo- 
prhism of M(4, R) are diluted for Clifford algebra isomorphisms of matrix algebras. Neverthe- 
less, there are two ways in which the theory of Clifford algebras can be used for the purpose 
at hand. First, one can uncover more classes for 4 x 4 matrices whose minimal polynomials 
can be calculated, and whose Jordan structure is akin to those of skew-Hamiltonian matrices. 
This is achieved by first considering matrices in M(4, R) as elements of suitable Clifford al- 
gebras and inspecting their behaviour under Clifford conjugation and/or reversion, and then 
representing such matrices via quaternions. In some cases the H ® i? representations of these 
matrices enables a complete characterization of their minimal polynomial. Arguably, one 
would have not been lead to consider these classes otherwise. Secondly, one can use Clifford 
algebra representations of matrices of larger size to give a partial characterization of their 
minimal polynomials. Whilst a complete characterization of possible minimal polynomials 
for such matrices is the subject of future work, one can already say more than what would 
be possible without using Clifford algebras. 

Let us now explore the first extension. To that end, note that there are two standard 
involutions in the theory of Clifford algebras - reversion and Clifford conjugation., [10, 13] 
These are both anti-automorphisms. The matrix versions of these two involutions are easy 
for two classes of Clifford algebras. For Cl(n,0), reversion is Hermitian conjugation, while 
for C7(0, n) Clifford conjugation is Hermitian conjugation. Representing Cl(p + l,q + 1) as 
M(2, Cl(p, q)) (the algebra of 2 x 2 matrices with entries in Cl(p, q)), it is known that Clifford 
conjugation is represented as follows 



\ 



cc 



while reversion is 



A B 
C D 



' A B ^ 



C D 



jjrev _j^rev 
(Jrev j^rev 



( \ 

D cc B cc 



\ 



C cc A c 



J 



Here, and in the balance of this section, Z cc (respectively, Z rev ) stands for the Clifford 
conjugation (respectively, reversion) of a matrix (or its Clifford representation) Z. 

Let us illustrate how this can be used to find minimal polynomials for matrices stemming 
C7(2,2). Since C/(l, 1) is M(2,R), it follows that C/(2,2) is M{A,R). On Cl(l, 1) reversion 



\ c D J 



sends X to i? 2 A T i? 2 (which, in the notation introduced in Section 2, is Xp), while Clifford 
conjugation sends X to —JiX T Ji, which is Xh- Equivalently, since X is 2 x 2, X cc is 
adj(X), where, as usual, adj(X) is the classical adjugate of X. Thus, on C7(2, 2) we get 

/ \ TCV , , 

Cjj Ah j 

Thus, if A G C7(2, 2) equals its own reversion, then A and D are each other's adjugates, while 
B and C are 2x2 skew-Hamiltonian. A basis of 1-vectors for (7/(2, 2) consists of the following 
four matrices (written in H ® iJ form) : 

/! = -j <g> fc, / 2 = i <g> fc, / 3 = -1 <g> *, .A = -1 (8) j (5.10) 

This yields expressions for 2- vectors etc., which we omit. Now, since X equals its own reverse, 
it must be a linear combination of the identity, 1-vectors and 4-vectors. The last equation, 
for a basis of one- vectors for Cl(2, 2), thus yields the following H®H representation of of the 
most general X G Cl(2, 2) satisfying X rev = X. 

a(l®l)+p®k + l®s (5.11) 

withp, s pure-quaternions, with the latter having no k- component. Thus such an X is remark- 
ably similar to skew-Hamiltonian matrices, with the difference that the roles of j and k have 
been interchanged. Thus, we find, for instance that X 2 = (p.p — s.s — a 2 )l ® 1 + 2aX and 
hence such an X's minimal polynomial is quadratic. We omit the similar statements about 
the Jordan structure of such matrices, that this minimal polynomial yields. If X G CZ(2, 2) 
satisfies X rev = —X, then it has an H ® H representation akin to that for a Hamiltonian 
matrix, and therefore an analogue of Theorem (3.2) applies to it. 

Similarly, if X G C7(2,2) is minus its own reversion, then i) A = D F and B and C are 
both perskewsymmetric; and ii) the H ® H representation of X is given by 

X = a(l ® 1) + k ®p + q® 1 (5.12) 

with a G TZ, p, q G V and q.k = 0. Once again, this yields a quadratic minimal polynomial for 
X. 

Remark 5.1 • A calculation shows that matrices represented by Equation (5.11) [resp. 
Equation (5.12)] are precisely those self- adjoint with respect to the non-degenerate bilin- 



ear form on R 4 whose denning matrix is Mi®k (resp. Mk®i). Thus, the considerations 
of the previous paragraphs yield a natural Clifford theoretic interpretations for these 
bilinear forms. 

• By passing to Cl(3, 1) and performing an analysis akin to the one above for C7(2,2) 
one can show that those X € Cl(3, 1) satisfying X rev = X are again given by Equation 
(5.11), while those satisfying X cc — X are skew-Hamiltonian matrices. 

• It is worth emphasizing that the block structures of the matrices considered in the pre- 
vious paragraphs do not themselves reveal the simplicity of their minimal polynomials. 
It is only by passing to their H (g H representations that we are lead to these results. 

For higher dimensional matrix algebras arising from Clifford Algebras, we do not (yet) have 
an exhaustive set of results. Nevertheless, some conclusions can be drawn, which would have 
been difficult to arrive at without passing to Clifford Algebras. Let us illustrate this via 
C7(0, 6). This is M(8, R). Furthermore, Clifford conjugation is precisely matrix transposition 
in this case and thus a matrix is anti-symmetric iff it is minus its Clifford conjugation. Since 
Clifford conjugation of a p- vector in C7(0, 6) is minus itself iff p = 1, 2, 5, 6, an 8 x 8 matrix is 
anti-symmetric iff it is a linear combination of of these p-vectors. We use the following basis 
of 1-vcctors: 

d = a z <g> a x a z <g h (5.13) 

e 2 = o z o x (g J 4 

e 3 = a x ® a z a x <8> a x 

e 4 = a x ® a z a x <8> a x 

e 5 = a x (g) h <S> cr x a z 

e e = o z <g a x ® a z o x 

Here the cr's are the usual Pauli matrices. Using this one can write down a basis of p-vectors 
for p — 2, 5, 6 which we omit for brevity. The typical 8x8 anti-symmetric matrix is thus a 
real linear combination 
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X = y^ffie, + y^.PijCij + X! Pijklmeijklm + Pl23456ei23456 ( 5 -14) 
i=l »<j i<j<k<l<m 



One can now list a set of mutually exclusive conditions on these coefficients which are necessary 
and sufficient for X to have a quadratic minimal polynomial. From Proposition (3.1) we 
know that the minimal polynomial has to have the form p(x) = x 2 — A 2 . Due to the more 
complicated structure of Clifford multiplication on Cl(0, 6) this list of conditions, even for 
the quadratic case, are far too long to enlist. Therefore, we will just give sample instances of 
these conditions. 

To that end, it is first noted that this set contains conditions of two types. The first 
consists of conditions which merely equate some of the coefficients, pj, J C {1,2, 3, 4, 5, 6} in 
Equation (5.14) to zero. The latter consist of more complicated algebraic relations between 
the pj. To understand the difference between the two, it is first noted that a p-vector and a 
g-vector either commute or anti-commute. Conditions of the first type arise precisely when 
all the summands in Equation (5.14) anti-commute. Under these circumstances the minimal 
polynomial of X is clearly quadratic. The latter set of condition arises when there are some 
commuting summands in Equation (5.14). In this case the corresponding coefficients have to 
satisfy certain relations to ensure that p(x) — x 2 — A 2 is the minimal polynomial of X. By 
carefully considering the commutation relations between the 1,2,5 and 6-vectors in C7(0, 6) 
one can arrive at the aforementioned conditions. 

Enlisted below are instances, first of the first type of conditions and then of the second 
type of conditions. 

• i) X = piCi + J2k<iPki e kiJ2 1> iPij e i] + Pa/3 7 8ee a i3ySe, with i £ {a, /3, 7, 8, e}. 

• ii) X =p i e l + p af 3ySee a i3j8e +Pi23456ei23456, with i g {a, j3, 7, 6, e}. 
Examples of the second type of conditions are 

• X = pid +p 2 e 2 +P13613 +P23C23 with P1P23 = P2P13- 

• X = piei +P23e23 + P45C45 + Pi2345ei2345 with p x = p 4 5,P23 = P12345 and pi equal to 
P23 upto sign. 

Finally, in all the cases above A 2 is the Euclidean length squared of the vector of coefficients 
describing X. 

Octonions and Quadratic Minimal Polynomials: One special class of 8 x 8 matrices which 



always have quadratic minimal polynomials can be obtained via octonions. Whilst the octo- 
nions are not associative, one can attach two 8x8 matrices, cu(a), 9(a) to an octonion a, [17]. 
The former describes the effect on an octonion upon left multiplication by a, while the latter 
does the same for right multiplication by a. To describe them express the octonion a by a 
pair of quaternions, a = (01,02), a, G H, i = 1,2, via the Cayley doubling procedure, [10]. 
Then 



tu(a) 



/ \ 



and 

( „ \ 



0(a) = 



M a2 ®i Mi(g, ai 



Here — diag(l, —1, — 1, —1). 

Then, as shown in [17], the alternating identities yield 

uj(a 2 ) = (uj(a)) 2 (5.15) 
0(a 2 ) = (9(a)) 2 

Now since any octonion a satisfies a 2 — 2Ke(a)a+ \ a | 2 = 0, we see that the 8x8 matrices 
ui(a) and 9(a) have the quadratic polynomial p(x) = x 2 — 2Re(a)a;+ | a | 2 (as long as o ^ 0. 
Next, since the octonions are not associative one cannot expect Lo(ab) (resp. 9(ab)) to equal 
the product uj(a)uj(b) (resp. 9(a)8(b)). Nevertheless, if ab ^ 0, uj(ab) (resp. 9(ba)) is similar 
to uj(a)oj(b) (resp. 9(a)9(b)), [17]. Thus, the matrix lo (a) u (b) (resp. 9(a)9(b)) has a quadratic 
minimal polynomial, p(x) = x 2 — 2Re(a6)a;+ | ab | 2 = x 2 — 2 < a, b > x+ \ a \ 2 \ b \ 2 (resp. 
q(x) = x 2 - 2Re(6a)a;+ | ba \ 2 ). 

This is significant since the structure of the matrices oj(a)ui(b) (resp. 8(a)9(bj) is more 
complicated than that of lj(c) (resp. 8(c)), for an octonion c. 

We end this section with a discussion of how the method of [8] can be combined with 
those of this work to compute minimal polynomials of 4 x 4 matrices not covered above. The 
same discussion will also reveal why the method of [8] requires more computation than that 
proposed here. 

We first briefly recall the method of [8] for computing the minimal polynomial of a matrix 
X of size n x n. One first associates to the sequence {/, X, X 2 ,...} the matrices Gi,i = 



1, . . . , n, where Gi is the Gram matrix of the set of matrices {/, X, X 2 , . . . , X 1 } with respect 
to the inner product < Y, Z >= Tr(Y T Z) (here, for brevity, all matrices are assumed to be 
real). Thus, for instance 

Tr(J) Tr(X) Tr(X 2 ) 



G-2 = 



Tr(X T ) Tr{X T X) Tr(X T X 2 ) 



^ Tr{(X T ) 2 ) Tr((X T ) 2 X) Tr(X T ) 2 X 2 ) J 
The method then, in essence, consists of two steps: 

• One computes the ranks of the Gi's. Then the degree of the minimal polynomial of X 
is r iff the first i for which the rank of Gi is lower than i + 1 is r. 

• In this case it is also known that the kernel of G r is of dimension one. Furthermore, it 
is guaranteed that there is a vector in the kernel of G r whose last coefficient is non-zero. 
Normalizing this coefficient to one yields a vector (do, a\, . . . , a r _i, 1) in the kernel of 
G r . This vector yields the minimal polynomial of X to be p(x) = x r + Y^iZo a i x% ■ 

Thus, this method requires two steps i) Calculating the Gi and their ranks successively till 
one detects a drop in rank. Thus, this step requires a requisite number of trace calculations 
plus one's favourite method to compute ranks; ii) Computing a non-zero vector in the kernel 
of G r . 

The first step is amenable to the methods used in this work, since to find the trace of 
a matrix being represented in quaternion (or Clifford Algebra) form, one has to only find 
the coefficient of the 1 Cg) 1 term in the matrix. This rarely requires the full quaternionic 
expansion of the matrix. However, even for the classes of structured matrices considered 
here, these calculations involve more than those required by our methods. We illustrate this 
issue via the case of 4 x 4 real symmetric matrices. To detect a quadratic minimal polynomial, 
our method requires finding only X 2 . However, to find G3 and check if its rank is two, one 
needs terms such as Tr(X 3 ). While, this does not require the full calculation of X 3 , it requires 
more than a calculation of X 2 , because one has to find the 1 ® 1 term in X 3 . 

Even when the ranks of the Gi have been computed and the degree of the minimal poly- 
nomial found, one has to still find a non-zero element of the kernel of G r . This is typically 
difficult to do in closed form, whereas the methods used here do produce the minimal poly- 



nomials (for the classes of matrices considered here) in closed form. 

6 Conclusions 

In this work a complete characterization of the minimal polynomials of several important 
classes of 4 x 4 real matrices, including those of interest in applications, was provided. These 
were illustrated by relevant applications such as the determination of the Jordan structure of 
4x4 skew-Hamiltonian matrices. Extensions of these results via the usage of Clifford algebras 
was indicated. In particular, classes of matrices were found whose block structures bely 
their close similarity, vis a vis minimal polynomials, to skew-Hamiltonian and Hamiltonian 
matrices. Extensions of the preliminary results announced here for M 8 (R) will be the subject 
of future investigations. 
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